Nt) . H Hi^rt'dP Prote.-.ses and Polynucleotides En< <: lin-; The S*ime 



Query= SEQ ID NO : 3 

{1047 letters) 

Score E 

Sequences producing significant alignments: (bits) Value 

AC008121. 33. 67777. 200926 ^ e-153 

>AC008121.3 3 . 67777 .2 0092 6 
Length - 133150 

Score = 546 bits (275), Expect - e-153 
Identities = 275/275 (100%) 
Strand = Plus / Plus 

Query: 382 agcgatcctttaatgtggacagctgtgattggaactaataatatacatggacgctatcct 441 

I IIMIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIII 

Sbjct: 51038 agcgatcctttaatgtggacagctgtgattggaactaataatatacatggacgctatcct 51097 
Query: 442 cataccaagaagataaaaattaaagcaatcattattcatccaaacttcattttggaatct 5 01 

IIMMMIIIIIIIIIIMIIIIIMMMIMMMIMIMIIIIIIIMMIIIII 

Sbjct: 51098 cataccaagaagataaaaattaaagcaatcattattcatccaaacttcattttggaatct 51157 
Query: 5 02 tatgtaaatgatattgcactttttcacttaaaaaaagcagtgaggtataatgactatatt 5 61 

IIIMIIIIMIIIIIIIIIIIIIIMIIMIIIMMIIIIIIIIIIIIIIIIIIilll 

Sbjct: 51158 ta tgtaaatgatat tgcac tt t ttcacttaaaaaaagcagtgaggtataatgac tatatt 51217 
Query: 5 62 cagcctat ttgcctaccttttgatgttttccaaatcctggacggaaacacaaagtgtttt 621 

IIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIM 

Sbjct: 51218 cagcctat ttgcctaccttttgatgttttccaaatcctggacggaaacacaaagtgtttt 51277 



Query: 622 ataagtggc tggggaagaacaaaagaagaaggtaa 656 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Sbjct: 51278 ataagtggc tggggaagaacaaaagaagaaggtaa 51312 



Score - 502 bits (253), Expect - e-139 
Identities = 253/253 (100%) 
Strand ^ Plus / Plus 



Query: 7 95 gggtgacagtgggggaccattaatgtgctacttaccagaatataaaagattt tttgtaat 854 



Query 915 gccatcct tctaccaaaagtggctgacagagcat ttcttccatgcaagcactcaaggcat 974 

IIIMIIIIIIIIMMIIIIIIIIMIIIIMIIIIIIIMIIMIIIIIIIIIIIIII 

Sbjct: 79708 gccatcct tctaccaaaagtggctgacagagcatttcttccatgcaagcactcaaggcat 79767 
Query 97 5 acttactataaatattttacgtggccagatcctcatagctttatgt tttgtcatc ttact 1034 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIMIII 

Sbjct: 79768 acttactataaatattttacgtggccagatcctcatagctttatgttttgtcatc ttact 79827 

Query: 1035 agcaacaacataa 1047 
Sbjct: 79828 agcaacaacataa 79840 



Score = 387 bits (195), Expect - e-105 
Identities = 196/197 (99%) 
Strand = Plus / Plus 

Query: 187 gattgtggaacagcaccgct taaggatgtgttgcaagggtctcggattatagggggcacc 246 

III IIIIIIIIIIIIIIIIIIMIIIIIIIIIM I II 

Sbjct: 36091 gattgtggaacagcaccgcttaaggatgtgttgcaagggtctcggattatagggggcacc 36150 
Query: 247 gaagcacaagctggcgcatggccgtgggtggtgagcctgcagattaaatatggccgtgtt 3 06 

MIIIIIIMII IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMMMI 

Sbjct: 36151 gaagcacaagctggcgcatggccgtgggtggtgagcctgcagattaaatatggccgtgtt 36210 
Query: 3 07 cttgttcatgtatgtgggggaaccctagtgagagagaggtgggtcctcacagctgcccac 3 66 

lllllll MM MMMMMMMMMMMI I 

Sbjct: 36211 cttgttcatgtatgtgggggaaccc tagtgagagagaggtgggtcctcacagctgcccac 36270 
Query: 367 tgcactaaagacrctag 383 

MMIIIMIII INI 

Sbjct: 36271 tgcactaaagacac tag 36287 



Score = 363 bits (183). Expect - 9e-98 
Identities - 185/187 (98%) 
Strand = Plus / Plus 

Query : 1 atgcggctggggctcctgagcgtggcgytgttgtt tgtggggagctctcacttayactca 60 



):;-;- cKiccac-rar^r cocc(--c:c:9aaagcacagcTctcggcccc'- cgccggaaccggcggcr ag: ib':> . 



Query: 121 tcccagcaggctgaggccgtccgcaagaggctccggcggcggagggagggaggggcgcat 18 0 

IIIIIIIIIIIIIIIIMIMIIIIIIMIIIMIMIMIIIMIMMIIIIIIINI 

Sbjct: 35335 tcccagcaggc tgaggccgtccgcaagaggc tccggcggcggagggagggaggggcgca t 35394 

Query: 181 gcaaagg 187 
lllllll 

Sbjct: 35395 gcaaagg 35401 



Score = 290 bits (146), Expect = le-75 
Identities = 146/146 (100%) 
Strand = Plus / Plus 

Query: 651 aggtaacgctacaaatattttacaagatgcagaagtgcattatatttctcgagagatgtg 710 

III I Ml MM I I I 

Sbjct: 77571 aggtaacgc tacaaatatt ttacaagatgcagaagtgcat tatat ttc tcgagagatgtg 77630 
Query: 711 taattctgagaggagttatgggggaataattcctaacacttcattttgtgcaggtgatga 770 

II I II I I I I 

Sbjct: 77631 taattctgagaggagttatgggggaataattcctaacacttcattttgtgcaggtgatga 77690 
Query: 771 agatggagcttt tgatact tgcaggg 796 

IIIIIMMIIIIIIIIIIIIIIMI 

Sbjct: 77691 agatggagcttt tgatacttgcaggg 77716 
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r 1: AC008121. Homo sapiens 12 B...[gi:28626577] 



Links 
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DEFINITION 

ACCESSION 
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REFERENCE 
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Euteleostomi ; 
Homo . 



AC008121 105989 bp DNA linear PRI Ol-MAR-2003 

Homo sapiens 12 BAC RP11-407N8 (Roswell Park Cancer Institute Human 
BAC Library) complete sequence. 
AC008121 

AC00S121.43 GI:2S626577 
HTG. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 
1 (bases 1 to 105989) 

Muzny.D.M., Adams, C, Adio-Oduola , B . , Ali-osman, F . R . , Allen, C, 
Alsbrooks, S.L. , Amaratunge , H . C . , Are.J.R., Ayele,M., Banks , T . , 
Barbaria^J., Benton, J., Bimage,K., Blankenburg , K . , Bonnin,D., 
Bouck,J., Bowie, S., Brieva.M., Brown, E., Brown, M., Bryant, N. P., 
Buhay,C., Burch,P., Burkett,C., Burrell , K . L . , Byrd,N.C., 
Carron,T.F., Carter, M., Cavazos , S , R. , Chacko.J., Chavez, D., 
Chen,G., Chen,R., Chen,Z., Chiu,D., Chowdhry, I . , Christopoulos , C . 
Cleveland, C. D. , Cox,C., Coyle,M.D., Dathorne , S . R . , David, R., 
Davila,M.L. , Davis, C. , Davy-Carrol 1 , L . , Dederich, D . A . , 
Delaney,K.R. , Delgado,0., Denn,A.L., Ding,Y., Dinh,H.H., 
Douthwaite, K. J. , Draper, H. , Dugan-Rocha , S . , Durbin,K.J . , 
Earnhart,C., Edgar, D,, Edwards , C . C . , Elhaj,C., Emerling,S., 
Escotto,M., Falls, T., Ferraguto , D . , Flagg,N., Ford, J., Foster, P., 
Frantz,P., Gabisi,A., Gao,J., Garcia, A., Garner, T., Garza, N., 
Gill,R., Gorrell , J.H. , Guevara, W., Gunaratne , P . , Hale,S., 
Hamilton,?:., Han, J., Harris, C, Harris, K., Hart,M., Havlak,P., 
Hawes,A., Hernandez , J . , Hernandez , 0 . , Hodgson, A., Hogues,M., 
Holloway,C., Hollins,B., Homsi,F., Howard, S., Huber,J., Hulyk,S., 



loshikhes, I . , Jackson , L . E . , Jacobson,B., Jia,Y., 
R., Jolivet,S., Joudah,S., Karlsson,E., Kelly, S., Khan,U. 
Korvah,J., Kovar,C., Kratovic,J., Kureshi,A., Landry, N., 
Lee,E., Lewis, L.C., Lewis, L,, Li, J., Li,Z., Lichtarge,0. 
Liu, J., Liu,W., Loulseged, H . , Lozado,R.J., Lu,X., 
Lucier,A., Lucier,R., Luna,R., Ma , J . , Maheshwari , M . , Mapua,P,, 
Marondel,!., Martin, R., Mart indale , A . , Martinez, E., Massey,E., 
Mawhiney,E., McLeod.M.P., Meador,M., Mei,G., Merscher,S., 



Hume , J . 
Johnson 
King, L . 
Leal,B. 
Lieu, C . 



Peters , 1. . 
Rives , M . , 



Rol f c, 



Qu 1 1 , 



Savery , G 



hitp://ww\v.ncbi.nlm.nih.gov/cntrc/yqucryJcg]\'cnul=Rctrieve&db=nuclcotidc&I 5/6/2()(J3 



>XM_171629 ACCESSION:XM_171629 NID: gi 22062231 ref XM_171629.1 Homo 
sapiens similar to cortical granule serine protease 1 
precursor (LOC257238) , mRNA 
Length = 1295 



Identities = 141/153 (92%), Positives = 141/153 (92%) 
Frame = +3 

Query: 62 CGTAPLKDVLQGSRIIGGTEAQAGAWPWWSLQIKYGRVLVHVCGGTLVRERWVLTAAHC 121 

CGTAPLKDVLQGSRIIGGTEAQAGAWPWWSLQIKYGRVLVHVCGGTLVRE 
Sbjct: 3 CGTAPLKDVLQGSRIIGGTEAQAGAWPWWSLQIKYGRVLVHVCGGTLVRE 155 

Query: 122 TKDSDPLMWTAVIGTNNIHGRYPHTKKIKIKAIIIHPNFILESYVNDIALFHLKKAVRYN 181 

SDPLMWTAVIGTNNIHGRYPHTKKIKIKAIIIHPNFILESYVNDIALFHLKKAVRYN 
Sbjct: 156 SDPLMWTAVIGTNNIHGRYPHTKKIKIKAIIIHPNFILESYVNDIALFHLKKAVRYN 326 

Query: 182 DYIQPICLPFDVFQILDGNTKCFISGWGRTKEE 214 

DYIQPICLPFDVFQILDGNTKCFISGWGRTKEE 
Sbjct: 327 DYIQPICLPFDVFQILDGNTKCFISGWGRTKEE 427 



Identities = 131/131 (100%), Positives = 131/131 (100%) 
Frame = +3 

Query: 215 GNATNILQDAEVHYISREMCNSERSYGGIIPNTSFCAGDEDGAFDTCRGDSGGPLMCYLP 274 

GNATNILQDAEVHYISREMCNSERSYGGIIPNTSFCAGDEDGAFDTCRGDSGGPLMCYLP 
Sbjct : 900 GNATNILQDAEVHYISREMCNSERSYGGIIPNTSFCAGDEDGAFDTCRGDSGGPLMCYLP 1079 

Query : 275 EYKRFFVMGITS YGHGCGRRGFPGVYIGPSFYQKWLTEHFFHASTQGILTINILRGQILI 334 

EYKRFFVMGITSYGHGCGRRGFPGVYIGPSFYQKWLTEHFFHASTQGILTINILRGQILI 
Sbjct: 1080 EYKRFFVMGITSYGHGCGRRGFPGVYIGPSFYQKWLTEHFFHASTQGILTINILRGQILI 1259 

Query: 335 ALCFVILLATT 345 

ALCFVILLATT 
Sbjct: 1260 ALCFVILLATT 1292 
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FEATURES 

source 



gene 



CDS 



LOC257238 1295 bp mRNA linear PRI Ol-AUG-2002 

Homo sapiens similar to cortical granule serine protease 1 
precursor (LOC257238) , mRNA. 
XM_171629 

XM_171b^y . 1 GI:220 62231 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Marmalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1295) 
NCBI Annotation Project. 
Direct Submission 

Submitted (31-JUL-2002) National Center for Biotechnology 
Information, NIH, Bethesda, MD 20894, USA 

GENOME ANNOTATION REFSEQ : This model reference sequence was 
predicted from NCBI contig NT 009782 by automated computational 
analysis using gene prediction method: GenomeScan, supported by EST 
evidence . 
Also see: 

Documentation of NCBI ' s Annotation Process 

Location/Qualifiers 
1. .1295 

/organism="Homo sapiens" 
/db_xref-"taxon: 9606" 
/chromosome= " 12 " 
1 . .1295 

/gene="LOC257238" 
/ db_xre f -" In t er imi D : 257238 " 

168. .1295 
/gene=''LOC257238" 
/codon_start=l 

/product^" similar to cortical granule serine protease 1 
precursor" 

/protein id^" XP 171629.1 " 

^nb xref- "GT : 220f^2?'^2 " 



TPGSAPPLAR,^CFRRMARILASDGYLLPFPSKPSWTLCPLSPQFWAQESEKKMSSDN 
QV;SADEDEGQLSRLIRKSRDSPFVPIGNATNILQDAEVHYISREMCNSERSYGGIIPN 

^s^cagd^dgafdtcrgdsggplmcylpeykrffv>:gitsyghgcgrrgfpgvyigps 



http://www.ncbi.nlm.nih.gov/cntrez/queryTcgi^cmd^Retncve& ^2/4/20^2 



/note=''Region: sinart00020, Tryp_SPc. Trypsin-like serine 
protease; Many of these are synthesised as inactive 
precursor zymogens that are cleaved during limited 
proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and others 
are inactive due to substitutions of the catalytic triad 
residues" 
misc_f eature 171. .434 

/gene="LOC257238" 

/note= "Region: pfain00089, trypsin. Trypsin" 
misc_feature 819. .1199 

/gene="LOC257238" 

/note="Region: pfain02395, IGAl, Immunoglobulin Al 
protease. This family consists of immunoglobulin Al 
protease proteins. The immunoglobulin Al protease cleaves 
immunoglobulin IgA and is found in pathogenic bacteria 
such as Neisseria gonorrhoeae. Not all of the members of 
this family are TgA proteases (one member from K. rnl i 
cleaves human coagulation factor V, another one is a 
hemoglobin protease) " 
misc_feature 900.. 1187 

/gene="LOC257238" 

/note=" Region: smart00020, Tryp_SPc, Trypsin-like serine 
protease; Many of these are synthesised as inactive 
precursor zymogens that are cleaved during limited 
proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and others 
are inactive due to substitutions of the catalytic triad 
residues" 
misc_feature 903.. 1187 

/gene="LOC257238" 

/note= "Region: pfam00089, trypsin. Trypsin" 
variation 681 

/gene="LOC257238" 

/allele="C" 

/allele="T" 

/ db_xre f = " dbSNP : 3742071 " 
BASE COUNT 352 a 265 c 314 g 364 t 

ORIGIN 

1 attgtggaac agcaccgctt aaggatgtgt tgcaagggtc tcggattata gggggcaccg 
61 aagcacaagc tggcgcatgg ccgtgggtgg tgagcctgca gattaaatat ggccgtgttc 
121 ttgttcatgt atgtggggga accctagtga gagagagcga tcctttaatg tggacagctg 
181 tgattggaac taataatata catggacgct atcctcatac caagaagata aaaattaaag 
241 caatcattat tcatccaaac ttcattttgg aatcttatgt aaatgatatt gcactttttc 
301 acttaaaaaa agcagtgagg tataatgact atattcagcc tatttgccta ccttttgatg 
361 ttttccaaat cctggacgga aacacaaagt gttttataag tggctgggga agaacaaaag 
421 aagaaggtat agcaggcttt gtgactgtgg tgtcctgtgg tctttacaag ctaaagtaca 
481 gaagagatca gaaaatgtca attcatctta ttcacatgag agttgctgcc caaggatttg 
541 ttgttggagc tgtgactcta gctcgaggct ttgcaggagg cgcacctgcg atggccttac 
601 cgccaggtga gcccggcggg ctctcatcgc ctcagccgga gattccaact gcagggagcg 



901 gcaacgctac aaatattuta caagatgc 
961 attctgagag gagttatggg ggaataat 
1021 atggagcttt tgatacttgc aggggtga 



ag aagcgcatta tacLLcLcga gagatgtgta 

tc ctaacacttc attttgtgca ggtgatgaag 

ca gtgggggacc attaatgtgc tacttaccag 

'-Tagt ta:^g-: a:v:^ggcLC" cgtcgaagaa: 



http;//w\vw.ncbi.nlnLnih.gov/cntre7yqucry.fcgi?cmd-Retneve&db-nucleotide&lisC 12/4/2002 



1261 ctttatgttt tgtcatctta ctagcaacaa cataa 

// 



Revised: July 5, 2002. 



Disclaimer [ Write to the Help Desk 
NCBI I NLM I NIH 



Dec 2 2002 13:45:47 



http:/Avw\v.ncbi.riIm.nih.gov/entrc7yqucryicgr?cmd=Rctncve&db~nucleotidcc^^ 12/4/2002 



>XM_208689 ACCESSION : XM„2 0868 9 NID: gi 30157117 ref XM_208689,3 
Homo sapiens similar to adrenal secretory serine 
protease precursor (LOC283471), mRNA 
Length = 1457 

Score = 712 bits (1819), Expect = 0.0 

Identities = 344/348 (98%), Positives - 345/348 (98%), Gaps = 3/348 (0%) 
Frame - +1 

Query: 1 MRLGLLSVAL-FVGSSHL-SDHYSPSGRHRLGPSPEPAASSQQAEAVRP:RLRRRREGGAH 58 

MRLGLLSVAL FVGSSHL SDHYSPSGRHRLGPSPEPAASSQQAEAVRKRLRRRREGGAH 
Sbjct: 40 MRLGLLSVALLFVGSSHLYSDHYSPSGRHRLGPSPEPAASSQQAEAVRKRLRRRREGGAH 219 

Query: 5 9 AKDCGTAPLKDVLQGSRIIGGTEAQAGAWPWWSLQIKYGRVLVHVCGGTLVRERWVLTA 118 

A+DCGTAPLKDVLQGSRIIGGTEAQAGAWPWWSLQIKYGRVLVHVCGGTLVRERWVLTA 
Sbict: 220 AEDCGTAPLKDVLOGSRIIGGTEAQAGAWPWWSLQIKYGRVLVHVCGGTLVRERWVLTA 399 

Query: 119 AHCTKD-SDPLMWTAVIGTNNIHGRYPHTKKIKIKAIIIHPNFILESYVNDIALFHLKKA 177 

AHCTKD sdplmwtavigtnnihgryphtkkip:ikaiiihpnfilesyvndialfhlkka 

Sbjct: 400 AHCTKDASDPLMWTAVIGTNNIHGRYPHTf'.KIKIKAIIIHPNFILESYVNDIALFHLKKA 579 

Query: 178 VRYNDYIQPICLPFDVFQILDGNTKCFISGWGRTKEEGNATNILQDAEVHYISREMCNSE 237 

VRYNDYIQPICLPFDVFQILDGNTKCFISGWGRTKEEGNATNILQDAEVHYISREMCNSE 
Sbjct: 580 VRYNDYIQPICLPFDVFQILDGNTKCFISGWGRTKEEGNATNILQDAEVHYISREMCNSE 759 

Query: 23 8 RSYGGIIPNTSFCAGDEDGAFDTCRGDSGGPLMCYLPEYKRFFVMGITSYGHGCGRRGFP 2 97 

RSYGGIIPNTSFCAGDEDGAFDTCRGDSGGPLMCYLPEYKRFFVMGITSYGHGCGRRGFP 
Sbjct: 760 RSYGGIIPNTSFCAGDEDGAFDTCRGDSGGPLMCYLPEYKRFFVMGITSYGHGCGRRGFP 939 



Query: 298 GVYIGPSFYQKWLTEHFFHASTQGILTINILRGQILIALCFVILLATT 345 

GVYIGPSFYQKWLTEHFFHASTQGILTINILRGQILIALCFVILLATT 
Sbjct: 940 GVYIGPSFYQKWLTEHFFHASTQGILTINILRGQILIALCFVILLATT 1083 
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LOCUS LOC283471 1457 bp mRNA linear PRI 28-APR-2003 

DEFINITION Homo sapiens similar to adrenal secretory serine protease precursor 

{LOC283471) , mRNA. 
ACCESSION XM_2 0868 9 

VERSION XM_2085S9 . 3 GI ; 30157117 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo, 
1 (bases 1 to 1457) 

International Human Genome Sequencing Consortium. 
The DNA sequence of Homo sapiens 
Unpublished (2 003) 

GENOME ANNOTATION REFSEQ : This model reference sequence was 
predicted from NCBI contig NT_029419 by automated computational 
analysis using gene prediction method: BLAST, supported by mRNA and 
EST evidence . 
Also see: 

Documentation of NCBI ' s Annotation Process 

On Apr 28, 2003 this sequence version replaced gi : 29743 195 . 
Location/Qualifiers 
1. .1457 

/organism^ "Homo sapiens" 
/mol_type= "mRNA " 
/db„xref =" taxon : 9606" 
/chromosome- " 12 " 
1 . . 1457 

/gene-"LOC283471" 
/db_xref Inter imID : 2 8347 1 " 
40. .1086 

/gene="LOC283471" 
/codon_start=l 

/product="similar to adrenal secretory serine protease 
precursor" 

/protein_id="XP„208689 .3" 



FEATURES 

source 



gene 



CDS 



;^::r iLH:sY\^;DiALFH:-r:?:AVRVNDViQ?:cLPr dvfq:ldg::t?:cfisgv;grtk 

NATNILODAEVHYISRr:MCNSERSYGGIIPNTSFCAGDEDGAFDTCRGDSGGP:...MCYL 



http://www.ncbiJilmjiih.gc)v/cinrc/yqLieryJcei7cnul=Rclncve<t 3/()/2()()3 



/gene="LOC283471" 

/note="COG5640; Region: Secreted trypsin-like serine 
protease [ Post translat ional modification, protein 
turnover, chaperones] " 
/ db__xr e f = " CDD : COG 5 6 40 " 
misc_f eature 268 . . 978 

/gene="LOC283471" 

/note="Tryp_SPc; Region: Trypsin-like serine protease" 
/ db_xre f = " CDD : sma rtOOO 2^ " 
misc_f eature 271.. 978 

/gene="LOC283471" 

/note=" trypsin; Region: Trypsin" 

/ db_xre f = " CDD : pfa m0008 9 " 
BASE COUNT 419 a 269 c 327 g 442 t 

ORIGIN 

1 gggaagtacc tgccgccatc ttgctcacca gcctccaaaa tgcggctggg gctcctgagc 

61 gtggcgctgt tgtttgtggg gagctctcac ttatactcag accactactc gccctctgga 

121 aggcacaggc tcggcccctc gccggaaccg gcggctagtt cccagcaggc tgaggccgtc 

181 cgcaagaggc tccggcggcg gagggaggga ggggcgcatg cagaggattg tggaacagca 

241 ccgcttaagg atgtgttgca agggtctcgg attatagggg gcaccgaagc acaagctggc 

301 gcatggccgt gggtggtgag cctgcagatt aaatatggcc gtgttcttgt tcatgtatgt 

361 gggggaaccc tagtgagaga gaggtgggtc ctcacagctg cccactgcac taaagacgct 

421 agcgatcctt taatgtggac agctgtgatt ggaactaata atatacatgg acgctatcct 

481 cataccaaga agataaaaat taaagcaatc attattcatc caaacttcat tttggaatct 

541 tatgtaaatg atattgcact ttttcactta aaaaaagcag tgaggtataa tgactatatt 

601 cagcctattt gcctaccttt tgatgttttc caaatcctgg acggaaacac aaagtgtttt 

661 ataagtggct ggggaagaac aaaagaagaa ggtaacgcta caaatatttt acaagatgca 

721 gaagtgcatt atatttctcg agagatgtgt aattctgaga ggagttatgg gggaataatt 

781 cctaacactt cattttgtgc aggtgatgaa gatggagctt ttgatacttg caggggtgac 

841 agtgggggac cattaatgtg ctacttacca gaatataaaa gattttttgt aatgggaatt 

901 accagttacg gacatggctg tggtcgaaga ggttttcctg gtgtctatat tgggccatcc 

961 ttctaccaaa agtggctgac agagcatttc ttccatgcaa gcactcaagg catacttact 

1021 ataaatattt tacgtggcca gatcctcata gctttatgtt ttgtcatctt actagcaaca 

1081 acataaagaa attctgaagg ctttcatatc tttattttgc attgtgtccc tttctatgtt 

1141 ctatataatg aacatcattt attcttctag caattaattg cctacattag agatttcatg 

1201 tgaacatttt atgggctata agtattgtga cagatataca attgtaattt tggcactgaa 

1261 tcacatgtct ccttgaaata tcttgattat tttataatca taattctgta tctggaatac 

1321 tcatagagtt tgtacaaaat attcagttaa acatatattt tatgtgtata aatgccaaat 

1381 aatagtttat aattaaaatg aaagctgtca tttggttaaa ttaataaaaa ttctttctta 
1441 gattttattc taaaaaa 
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